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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: Zsebo, Krisztina M. 

Bosselman, Robert A. 
Suggs, Sidney V. 
Martin, Francis H. 

(ii) TITLE OF INVENTION: Stem Cell Factor 

(iii) NUMBER OF SEQUENCES: 104 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 

(B) STREET : ^6 3 00 Sears Tower, 233 South Wacker Drive 

(C) CITY: Chi 

(D) STATE: Illinois 

(E) COUNTRY: Ttoited States of America 

(F) ZIP: 60606A6402 

(v) COMPUTER READABLE^ FORM : 

(A) MEDIUM TYPE : Wloppy disk 

(B) COMPUTER: IBM\PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION iJATA: 

(A) APPLICATION NUMBI 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DAI 

(A) APPLICATION NUMBER: >08/449,649 

(B) FILING DATE: 24~MAY-\995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07X982,255 

(B) FILING DATE: 25-NOV-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/58^,701 

(B) FILING DATE: 01-OCT-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/573,1 

(B) FILING DATE: 24 -AUG- 1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/537, 198> 

(B) FILING DATE: ll-JUN-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/422,383 

(B) FILING DATE: 16-OCT-1989 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clough, David W. 

(B) REGISTRATION NUMBER: 36,107 

(C) REFERENCE /DOCKET NUMBER: 01017/32953A\ 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312/474-6300 

(B) TELEFAX: 312/474-0448 
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(C) TELEX: 
(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Glu Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp lie Thr 
1 5 10 15 

Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
35 40 45 

Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser 
50 55 60 

Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie Asp Lys Leu Gly 
65 70 75 80 

Lys lie Val Asp Asp Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
85 90 95 

Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro 
100 105 110 

Glu Glu Phe Phe Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp 
115 120 125 

Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 
130 135 140 

Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
145 150 155 160 

Pro Pro Val Ala Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= Inosine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ACRTTYTTNG GNGCRTTYTC YTCCAT 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 11 & 14 

(D) OTHER INFORMATION: /mod_base 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
AARAAYTCYT CNGGNGTRAA RTT 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
GTYTCNGGYT TYTT 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
ATGGARGARA AYGCCCCCAA RAAYGT 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

CCNAAYGAYT AYATGWTMAC 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GGNGGNARCA TRAANGGYTT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
ACCAKAARAT CTTYAAANCG ATC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GTATTTTCAA TAGATCCATT GA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CCAACTATGT CGCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTAGTCAAGC TGACTGATAA G 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TAACCAACAA TGACTAGGCA A 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTCCAGAGTC AGTGTC 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GCGAAGCTTG CCTTTCCTTA TGAAGAAGA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCGCCGCGGT TACGGTGGTA ACATGAAGGG CTTTGTGA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



GATAAATGCA AGTGATAATC C 



21 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCGGTCGACC CGCGGAACTT TAAGTCCATG CAACAC 36 
(2) INFORMATION FOR SEQ ID NO : 18 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CACCCGCGGT TATGCAACAG GGGGTAACAT AAATGG 36 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CACCCGCGGT TAGGCTGCAA CAGGGGGTAA CATAAA 36 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTTAATGTTG AAGAAACC 18 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 



GATGGTAGTA CAATTGTCAG AC 



22 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GTCTGACAAT TGTACTACCA TC 22 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CAATTTAGTG ACGTCTTTTA CA 22 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TTAGATGAGT TTTCTTTCAC GCAC 24 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAATCATTCA AGAGCCCAGA ACCC 24 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
AACATCCATC CCGGGGAC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
CTGGCAATAT TTTAAGTCTC AAGAAGACC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GCGCCGCGGC TCCTATAGGT GCTAATTGG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCTCACCACT GTTTGTGCTG GATCGCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
GGTGTCTAGA CTTGTGTCTT CTTCATAAGG A 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCCCCCCHGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
TTTTTTTTTT TTTTTTTTGG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
TTTTTTTTTT TTTTTTTTAG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 





- 196- 



(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



TTCGGCCGAT CAGGCCCCCC CCCC 



24 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36; 
TTCGGCCGGA TAGGCCTTTT TTTTTTTTTT 30 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GGCCGGATAG GCCTCACNNN NNNT 24 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GGCCGGATAG GCCTCAC 17 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin (660 . . 773 , 1184.. 1246, 2053.. 2223, 2837.. 2993, 

3692. .3774) 
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(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: j oin ( 720 . . 773 , 1184.. 1246, 2053.. 2223, 2837.. 2993, 

3692 . .3774) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AAAGTATCTT TCTATTGGCG AAGGACATGT TTTCCCATAA GTGGTAAACA AACTGTCTGC 60 

ACATAATAAT TATCTTGCTG CCGTAAAGAT TAGGTTAAAT TCTGCCTTCG ATCTAAAAAC 12 0 

ACACCCTTCT GTCAATCCGA GGAGCAGTGT GCTAGTCTAG AGGTCTAAAT GAAGGCTCCT 180 

TTCACGGTTG TATTTCTGCT CCCCAAATTG TCCACATTTA AAAGGAGAGT GCTTCTTTTC 24 0 

AGCCTTAGGC TCTGAATTTC ATGCATTCCT CCATTTTCCG AGGTCCCCCC CAAGTGATAA 300 

TTCTGTTACA CGTTGCTACA AGTTCATCCC TAATTGCCGT CAAGAAACTG ACTGTAGAAG 360 

GCTTACCACA GACGTTGTAA CCGACAGTAA AGCCATTGAA AGAGTAATTC AAACAGGATG 420 

GAAGCCAGGA GTATTTTGTG GCTGTTGCTC TTTTTCTTTT CAGTTTGGTG AGAGCAGCTT 480 

GAATGCTTAA CATTTAAGCC ATCAGCTTAA AACAAAACAA AACAAAACAA AAAAAAACCC 54 0 

CGCTCTGGCA TATTTGCACT TAACACATAC GGTATAAGGT GTTACTGGTT TGCATAGTTC 600 

TGGATTTTTT TTTTTTAAAA ACTGATGGAC ACCAAGAAAT GTTTCTGTTC TTTGTTTAG 65 9 

ACT TGG ATT ATC ACT TGC ATT TAT CTT CAA CTG CTC CTA TTT AAT CCT 707 
Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 -15 -10 -5 

CTC GTC AAA ACT CAG GAG ATC TGC AGG AAT CCT GTG ACT GAT AAT GTA 755 
Leu Val Lys Thr Gin Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val 

15 10 

AAA GAC ATT ACA AAA CTG GTAAGTAAAG AATGATTTTG GCATCTATAA 803 
Lys Asp lie Thr Lys Leu 
15 

GTCTTCCCTG TGCTTGCTGA C C AC ATAGGT TCAGGGCACT CCCGACAGGA GTTCCCAGCT 863 

TTCTAAGATA AGGAATCACT GTACGAGTCT GAAGTGCTTC TTCTGGGCAA ATGGGAGATG 923 

CTTAGGTCAT GGAGGGTTTA TCTGTATAAC TGGCCCTTTG CACACCAACA AAGTGAC TGA 983 

CTGGCTTTTG CCTGTTACCT ACTGTCTCCA GTCCTGGGCA TGGTATATAC TTAGGCACCC 1043 

AAGATTGGAT TTACAACTCA AGCATTATAT ATTGGACAAC ACGGGGTATG AGATATTAAT 1103 

GATATGTCAG GTTGGATGGA TGAGTTTTCT CAAGAAATTC TCTTGTATTT ACTCACGTTT 1163 

TCATTTCTTG GTCTCTGTAG GTG GCG AAT CTT CCA AAT GAC TAT ATG ATA 1213 

Val Ala Asn Leu Pro Asn Asp Tyr Met lie 
20 25 

ACC CTC AAC TAT GTC GCC GGG ATG GAT GTT TTG GTATGTAGTC CACACACTTC 1266 
Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
30 35 

TGAGTTGCCT TTTAGTAGCT AATGGGTGAC CTGTGCTTAT TCACATTGAA GACATTATTT 1326 

GCTCTTTGTC GTTTTTAGAT GTTGACCTAT AATTTTTCCT TCAAGCTGCT GCTAAGATTA 1386 

TCAGTGAGCA TTTCAGTATG TGTTTTAAGC CTACTCATTA AAAGGAAATG GCTCATCTTA 1446 
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GACGTAGCAA 


CCGATGTTAA 


TTTTTCCCCA 


GGCATCTCTC 


AGAGGGACTT 


GAATGTTAAA 


1506 


ATCATGTTAA 


ATTTCCTCCT 


TGGCTATGTT 


ATTTCTCATG 


GCTATGTTAT 


TCCTATTCGT 


1566 


ATTTCATTTA 


AAGGGACGGA 


ATATTTATTG 


TATTTCTGAA 


CTTTTTCAGG 


CATGCATCCG 


1626 


fcrGTCTTTGAA 


TAAAACACTA 


AGACTCCTTC 


TAGTAATGTT 


TGTAATCCTG 


TCTGTATCGA 


1686 


ATGTCTTTGA 


AAACGCAGTG 


ACTAAGCCAT 


AAATAATCTT 


CCACAGAACG 


TCCAGTGGTT 


1746 


CATGAACTTT 


GTATGTGGGG 


GTGGGGCAAG 


AATTGTCTCA 


CTATTGGTCA 


AGGAAGAGAA 


1806 


GGTAAGGTAT 


GCAAGGGTGG 


TTTAATCTTC 


TTCCGTGGAA 


GGACAAAATC 


ATCTATCATT 


1866 


TCCTCTGATC 


TCTATGCATT 


TGTTTGTTTT 


GAACTGAATC 


TGACTTGAGC 


AAGAGTTGGC 


1926 


GTCCTGTGTT 


CTGAGGAAAC 


TCTTTGTCCT 


GCAGTCAGTG 


ACTAAAAGTG 


CTGAGAGATC 


1986 


TGAAGAGCAC 


TCTGAATCTG 


CCATATTTTT 


AATAGATGCT 


TTGTCTTCTC 


TTTGAATTTC 


2046 


TTCCAG CCT 


AGT CAT TGT TGG TTA CGA GAT ATG GTA ACA CAC 


TTA TCA 


2094 



40 45 50 

GTC AGC TTG ACT ACT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC 2142 
Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly 
55 60 65 

TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GGG AAA ATA GTG GAT GAC 2190 
Leu Ser Asn Tyr Ser lie He Asp Lys Leu Gly Lys He Val Asp Asp 
70 75 80 85 

CTC GTG GCA TGT ATG GAA GAA AAT GCA CCT AAG GTAACTTGGT ATTCATCAGA 2243 
Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
90 95 



ATTATTTTTC 


TTATACTGAG 


CTCATGATGA 


GCAATTCACA 


ACCACTTGTA 


ATTCCAGCTC 


2303 


CAGAGGACAT 


TATCCCCTCT 


TTGGATGCCA 


TAGGAATCTG 


CTCTCAAATA 


TGTAGATACC 


2363 


ACCTCTGCCA 


CCTCAGCACA 


TACATACACA 


TAATTAAAAA 


ATAGAAACAT 


TAAAGGAGTT 


2423 


CCAATCAATC 


CTTATTCTTT 


TCTGTATTCA 


GTATGCCCAG 


ATGTAAATTC 


TAGGAATATG 


2483 


TTTTAAAGGC 


TAATTCTTAT 


TTTGTAATAA 


GCAGCTTTAA 


AATTCTTAAT 


TGTTTTTTCG 


2543 


GGTCACTTTA 


TTGTCCTATT 


GCCACGACAT 


TGTCCTGTCC 


CATTGTCTGT 


TATTCCTTCT 


2603 


GTTTTGTTTA 


TTGTTCCCTA 


GTTACTTTGA 


TCATGAGATT 


GACCTGTTAC 


CCGTTGTTAT 


2663 


TCTCTGTAGC 


CATTTTGAGT 


TGTGTCTATT 


AGAACAGCTG 


TTAAATTACT 


TGAATCATTG 


2723 


AGGACATAGT 


CAATAATCTA 


TTATGCTGAT 


CCAGTCAAGT 


CTATGAGTTA 


TTTGAAAACT 


2783 


AGAATCTTTG 


TTAATTATTT 


GTTTGCTTGT 


TTGTTTGTTT 


ATTATTTGTC 


TAG AAT 


2839 



Asn 

GTA AAA GAA TCA CTG AAG AAG CCA GAA ACT AGA AAC TTT ACT CCT GAA 2 887 

Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu 
100 105 110 

GAA TTC TTT AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC TTC 293 5 

Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe 
115 120 125 
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ATG GTG GCA TCT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA GGT 2983 
Met val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu Gly 
130 135 140 145 

CCT GAG AAA G GTAAGGCTTT TAAGCATTTC TTGTTTAAAT GTACATAGAA 3 033 
Pro Glu Lys 

AGCCTGAACT TCTGTAAGCC TCTACTGCTG AATCAACTAA ATGTGTTGCT GTAGAAAGAA 3093 

CGTGTGGGTT TTTCTGATAA AAACAAAAAG CAAATATCAA TGACTACCAA TGATTATTAT 3153 

CTAGCTTGAG AGATATGCCC TAAGACAGCG ATTCTCGATA TTTCTAAATT AAAGAATTGT 3213 

GTGATGGTGG CTCACATATT TTCTAACTGT GATATTTGCC AGGAGAGTAG AATAATGTTA 3273 

TTCTTCATCC CCAGAATTCC TAAGATTTCA CGTCTCATGT CTTTTCCATA AGGTTCAAAC 3333 

TCTGAGACTT GAGTTCTGAG CCTCAGCAGG TCATTCTGAA TCCCCACTCT CCCCGAGCTG 33 93 

GGTCCCTATG GGGGAACTAA CTTCATTGCT TTCTTTTAAA ACATGACGAG TTACCAACAG 3453 

CTCCTCGCTA TTATAAACAT GTTCCTAAGC ATGTCTGTGC ATGCAATAAG CCTTCACTCT 3513 

ACAAGACAGT TATGGTGTAT CGCTTGACAA AACTGAGCAG CCAAGCTGAG TATGAAATAA 35 73 

TAATCTAGAC TTGGGAGGCA GACCCAGCAC CTACTGTGAT ATTGCACTTC GCCTTTGGGG 3633 

GACTCTATGA TTCAAAAGTT CACCATGTAA CACTGACACA TTATTGCTTT CTATTTAG AT 36 93 

Asp 

TCC AGA GTC AGT GTC AGA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC 3 741 
Ser Arg Val Ser Val Arg Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
150 155 160 165 

AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGTAAGTACA CATATCTGAT 3794 
Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
170 175 

TTACTGCATG CATGGCTCCA AGTATCCTCT ATAGGAGTGT TGCATGGACT TAAAGTTTAT 3854 

AAATCACTAC TAATAATGCT GTTCTGTCAC TGTTATTCCT TGTATGGGCT TCCTGACAAT 3 914 

TAAATATCTG GTTTGTAGAA TCGGATCTCC TTAGAGGTTA AGATGACCAT GACAAAATTA 3974 

GGCCAATCAA CTTTCTGCGA AGGTTATTTT AAATAAGGCA CGAAATTAAT TGAAGGAAAA 4 034 

AAAAATACAA GCAAGGCCTT ATTTTGAATC ATGGTAGGCT TAAAATAGAC TTTGTGGAGA 4 094 

ATGTCCCTGA TCAAAGTGGA GTTTTCAGAT TTCAAGTGCA TGTGCTAACT CTCCACAATG 4154 

TCAAGGCTAT TTTCAGTTTT GTGTCTCCAT ATTTACTACT GCATGTTTGG AAATTTGCTG 4214 

ATGCTGTTAG ATTACCTAAG AATGTATGTT GAAGAAGAAT GGACTTCTTT CCCTAAAATT 4274 

TCTGTCCTCT TTGCCCAAGA ACCCACGTTC CTGGAAGACT ATCTTATTTT CATGTCTGTG 4334 

CAATGATCAT TATAAAGATT ATTGAATATA CTGGGAATAC TCTGGTTTCT GTTTTTACAG 4394 

ATTCATAATA GCTTATTCAG TCTTTAAAGA AAGTTCTCTG AAGTCCATGC TTTAGAATGT 4454 

TTCTCTATCA AAACTTGACC TGGACCTTAA ATAAAGCTAT ATTTAGTCTT TTTATCCCTG 4 514 

AAAAATATAT TTCACAGTGT AGACATTTGA TATACATCTA AGGGAAGGAT GCTGCCAGAA 4574 
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TGCTCGGGCT GGCAGTCTAC AAAGTCCACT GCTCTCAGGA TGGACTTCTG AAAGCGGAAA 4634 
TTGTGAACTG CATGCATATA ACATATCAGA TCCTCGAGC 4673 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 -15 -10 -5 

Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro Val Thr Asp Asn Val 

15 10 

Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met He 
15 20 25 

Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp 
30 35 40 

Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
45 50 55 60 

Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He 
65 70 75 

Asp Lys Leu Gly Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu 
80 85 90 

Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg 
95 100 105 

Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp 
HO 115 120 

Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu 
125 130 135 140 

Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg Val Ser Val Arg Lys 
145 150 155 

Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser 
160 165 170 

Ser Ser Ser Asn 
175 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 



52 



100 



148 



196 



244 
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(A) NAME /KEY : CDS 

(B) LOCATION: 26.. 844 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 101.. 844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

CTGGATCGCA GCGCTGCCTT TCCTT ATG AAG AAG ACA CAA ACT TGG ATT ATC 

Met Lys Lys Thr Gin Thr Trp He He 
-25 -20 

ACT TGC ATT TAT CTT CAA CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT 
Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr 
"iS -10 -5 

CAG GAG ATC TGC AGG AAT CCT GTG ACT GAT AAT GTA AAA GAC ATT ACA 
Gin Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp He Thr 
1 5 10 15 

AAA CTG GTG GCG AAT CTT CCA AAT GAC TAT ATG ATA ACC CTC AAC TAT 
Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met He Thr Leu Asn Tyr 
20 25 30 

GTC GCC GGG ATG GAT GTT TTG CCT AGT CAT TGT TGG TTA CGA GAT ATG 
Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
35 40 45 

GTA ACA 5*° TTA TCA GTC AGC TTG ACT ACT CTT CTG GAC AAG TTT TCA 292 
Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser 
50 55 60 

AA T ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GGG 34 0 

Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu Gly 
65 7 0 75 so 

OS tT A v T ? GAT ^ CTC GTG GCA TGT ATG GAA GAA AAT GGA CCT AAG 388 
Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
85 9 o 95 

fJt 2?* TCA CTG AAG AAG CCA GAA ACT AGA AAC TTT ACT CCT 436 

Asn val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro 
100 10 5 no 

q AA GAA I TC AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC 484 

Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp 
11S 120 12 5 

Ph° mI° S*? GCA o CT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA 532 
Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 

130 135 X40 

GGT CCT GAG AAA GAT TCC AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA 580 
Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
14b 150 155 160 

CCC CCT GTT GCA GCC AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT 628 
Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
16 5 170 175 

a™ ^ GCC GCA ^ TCC CCT GAA GAC CCA 0(30 CTA G AA TGG ACA GCA 676 
Arg Lys Ala Ala Lys Ser Pro Glu Asp Pro Gly Leu Gin Trp Thr Ala 

180 185 190 
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ATG GCA CTG CCG GCT CTC ATT TCG CTT GTA ATT GGC TTT GCT TTT GGA 724 
Met Ala Leu Pro Ala Leu lie Ser Leu Val lie Gly Phe Ala Phe Gly 
195 200 205 

GCC TTA TAC TGG AAG AAG AAA GAG TCA AGT CTT ACA AGG GCA GTT GAA 7 72 

Ala Leu Tyr Trp Lys Lys Lys Gin Ser Ser Leu Thr Arg Ala Val Glu 
210 215 220 

AAT ATA CAG ATT AAT GAA GAG GAT AAT GAG ATA AGT ATG TTG CAA CAG 82 0 

Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie Ser Met Leu Gin Gin 
225 230 235 240 

AAA GAG AGA GAG TTT CAA GAG GTG TAATT 84 9 

Lys Glu Arg Glu Phe Gin Glu Val 
245 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Thr Gin Thr Trp He lie Thr Cys He Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro 
-5 15 

Val Thr Asp Asn Val Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Asn Asp Tyr Met He Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
40 45 50 55 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
75 80 85 

Ala Cys Met Glu Glu Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys 
90 95 100 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
120 125 130 135 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
170 175 180 
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Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu lie 
185 190 195 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
200 205 210 215 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
220 225 230 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: join ( 905 .. 1018 , 1914.. 1976, 2572.. 2742 3152 

. .3307, 3513. .3595) 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: join ( 965 .. 1018 , 1914.. 1976, 2572.. 2742 3152 

. .3307, 3513. .3595) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CACAAGTGAG TAGGGCGCGC CCGGGAGCTC CCAGGCTCTC CAGGAAAAAT CGCGCCCGGT 60 

GCCCCGGGGA AGCCGGCGCT CCCTGGGACT TGCAGCTGGG GCGTGCAGGG CTGTGCCTGC 120 

CGGGTGAGAT ACTACAAAGA TAAATCAGTT GCACAAGTTC TTGAAACTCT ACAGTGTAAT 180 

AAGGAAAAAT AAGTCATGCA TAAAAGCAAC TATAATACAT AATAGAAAAT GTTATTTTCA 240 

AGCCGATGTG TAGGTTATGT GTGTTCGAGA GAGAGAGAGA GAAGACAGAT TACTTTCTGC 300 

TAGGGTTCAA GAATGCCTTC CTGTTGGCTA AGGAAATATT TTCCTTAAGT GGCTAAAAAG 360 

CTGTGTTTCA AAATATTCTT TTGATGTCTC ACAAATTCAG TGGAATTCTC TTAGGTCTAA 420 

AAATATACAT CTCTCTCACT TTAACTTGGT GTGCTATTGT AGATTATTGG ATTAAAGCAC 480 

TGCTCAGGGA TTATGCTGCT TCTTGCCAAG CAGTCTACAT TTAAAGTAGA AATAAGATGT 540 

TTCTTTTGGT GCCATAAGGT ATACATTTTA TGCATTCTCT AGTTTTTAGA AGATACCCTA 600 

AGGGCTAAGT CTTTAACATG CTGCTACAAG TTTATTCCTA ATTGCCATTG GGAAATTGGC 660 

TGAAGAAAGT TTTTAACAAA AGTTAACAAT ATTGTCATTG AGAGAATAAT TCAAAATGGA 720 

TTTTAACTAA AAGCTTTTAA AAACTTTGGT GAGCATAGCT TGAATGCGTA ATATTTAATT 780 

GCATTTAAGC CAATAACATA TATTAGACTG GTCTTTTTGT GCATCAAGGC ATTAGATGTT 840 

AAAAGTTTGA ATGATTACAG ATCTTAACTG ATGATCACCA AGCAATTTTT CTGTTTTCAT 900 
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TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT 
Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn 
" 20 -15 -10 



949 



CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT 997 
Pro Leu val Lys Thr Glu Gly He Cys Arg Asn Arg Val Thr Asn Asn 

GTA AAA GAC GTC ACT AAA TTG GTAAGTAAGG AATGCTTTAC CGTGCTGTGT 
Val Lys Asp Val Thr Lys Leu 
15 



1048 

1108 
1168 
1228 
1288 
1348 
1408 
1468 
1528 
1588 
1648 
1708 
1768 
1828 



AAAAAAGAGC TGTGGCTCTT TTTCCTGTGC TTGTTGATAA AAGATTTAGA TTTTTCTTGC 

CCCAAAGTAA TGTTTTCCTA AAGTGGGGAA AGTAATCACT GGGTTACAAT AAAGGGTTTA 

TAGAAAGCAG GTAGTGAGAT ATTTAGGGTC ATGGATAATT TGTTGGTAAA ACTGGCTAGT 

TGCACACCAC TGCTGTGACT GCTTCTTTGC TGGTCTTCTC CCCATCCTTC ATAGGCAGTG 

AAGGACCTTG GAGAGTTCGC TGTGTGCTGA TGGGCTTGCC CCAGCTTGTT CCCCATAATC 

TCTCCAGTGG GTTTCCCAGC ATGTTCTATT CCCCTTCACA TGTCTTCCTA CTCTTCTTTA 

AAAAGCCTAA CGAAAGGAAA TCTGAAATGG CTATTCTCCC AATTCAATCA GCAGGAAGAC 

CCTGTCACAT GTCAGTGGGT GTTTGCTCCT TCAGGGAACA TAGAGAGGTG ATTCATTGCC 

CACATGTTGA AGGGACTCAT CTCCCTGGTT TGTCACATTG AACTCTTCCC TCAGCGAAAG 

CATTTGCATT GCTTCCCGAA TTCCAAGATC ACAGGTGGAA GCTGAAATTC AGATCATGTT 

TCCAAAACTC AGTAGGTTAT ACCTAGCCAG GCATAACTGA ATTTGGAGTC TAAAAGATCT 

GTATTATCAC TTTTTTATTT TGAAGGATGC CTTTTGATTA CAGAGGGAAA TCAAGGATTA 

AAAATCAATA TACATGTAAA TATTGAAATT CATTGGTAAC TTTAAAAAGC ACAACAGTTT 

TGTGTGCTTT TCTCCAAAGC ACTACAAATA TGATTAATTG ATGTATAAGA ATTTTCTTAT 1888 

GGAATTTTTT TTTTTGTCTC TGTAG GTG GCA AAT CTT CCA AAA GAC TAC ATG 1940 

Val Ala Asn Leu Pro Lys Asp Tyr Met 
20 25 

ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG GTATGTAAAC 1986 
He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
30 35 

TACATTTCTG AGTTTCATTT TAGTAGCTCA TAGAAGAAAT GGGATCATTC ATATTGAGAT 
AGTACAC TAG CTGCTATTTA GGAGCTTGCT TATTGTCAGG ATTTGAAGAA TTTATCTTTG 
GAATTTGACT TGCAGGCTTT TTTTTCCCCC TCTTCCTGTT ACAAGAGTCC CTCCTCCTAT 
TACAATAGTC CCTCCTCCTC CTGTCACACT AGTCCCTTCT CTTCCTGTTA CAATAACCCC 
TGTCCTCCTA TTACAACATT TTAAGTAATG TAATATTAAT TTTAAAAATC TGGCCAGGCA 
CGGTGGTTCA TGCTTGTAAT CCCAGCACAT TGGGAAGCTG AGACGGGTGG ATCATTTGAG 
GTCAGGAAGT TTGAGACAGC CTGGC CAACA TGGTGAAACT TCCTCTCTAC TAAAAATAAA 
AAAGTAGCCA GGCATGGTGG CAGGCACTTG TAATCTGAGC TACTCGAGAG GCTGAGGCAG 
GAGAATCACT TGAGTAACTA AAACGATAGC TTTGAAGAGT ACTCCGAGTT TTATGGCACT 



2046 
2106 
2166 
2226 
2286 
2346 
2406 
2466 
2526 
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TACTTATTAA AATAG CTGTT TTGTCTCTTT TTTCATATCT TGCAG CCA AGT CAT 2580 

Pro Ser His 
40 

TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC AGC TTG ACT GAT 262 8 

Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp 
45 50 55 

CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC 2676 
Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser 
60 65 70 

ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG 2724 
lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val Glu Cys Val 
75 80 85 90 

AAA GAA AAC TCA TCT AAG GTAACTTTGT GTTCATTGGG ATTATTTTTC 2772 
Lys Glu Asn Ser Ser Lys 
95 



ATTACGCTTC 


TCTAAAAACC 


CATGCTTCTT 


GGTGCTGTTG 


GGGAAAATGA 


GGCACCTTTA 


2832 


TTTATGATAT 


TTTGATTGTA 


TAAACTTCAA 


ATTTAAAAAT 


CTTGTTCAGA 


TGAGCAAAGA 


2892 


AAACAAGTAT 


TTGCAGTTAT 


ACTGCAATAC 


TGAAGTGCAC 


ATTCTTGTGT 


TCACTGCCCC 


2952 


AGATTCAACT 


TGTGATCCCA 


CTGGGATCAC 


TACCCTGCAT 


TACCAATCTG 


AATTACATAC 


3012 


GTTAAAACAG 


CCATCTAAAA 


GTGCTAGTTG 


TAAGAGTCTA 


AATACTTGAA 


TCTTTGAGAG 


3072 


ACATATTTAT 


AGTCCATTAT 


CTTCACCTCA 


GTTAAGT CTG 


AAGACTATTT 


GAAAAATGTA 


3132 


ATCCTATTTT 


TTCTTCTAG GAT CTA AAA 
Asp Leu Lys 


AAA TCA TTC AAG AGC CCA GAA CCC 
Lys Ser Phe Lys Ser Pro Glu Pro 


3184 



100 105 

AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT 3232 
Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie 
110 115 120 

GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG 3280 
Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val 
125 130 135 

GTT TCT TCA ACA TTA AGT CCT GAG AAA GGTAAGACAT GTAAGCATTT 3327 
Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 145 

CCAGTTCAAA TGTAAACAAC AAACTTAAAT CTTCCCTATG TAGTAAGAAT CTACCTCTGT 3387 

GTTAAGCTGT AGCAAGATAC ATGCATGTAC GTCTAATAAA AAAGCAGATA TCAATAGCAC 3447 

AGAAGAAACT CTATAACTCA TACAAATCAC CATATAACAC TGACACATTA TTGCTTTC T A 3507 

TTTAG ATT CCA GAG TCA GTG TCA CAA AAC CAT TTA TGT TAC CCC CTG 3554 
lie Pro Glu Ser Val Ser Gin Asn His Leu Cys Tyr Pro Leu 
150 155 160 

TTG CAG CCA GCT CCC TTA GGA ATG ACA GCA GTA GCA GTA ATAGTAAGTA 3603 
Leu Gin Pro Ala Pro Leu Gly Met Thr Ala Val Ala Val 
165 170 175 

CATATATCTG ATTTAATGCA TGCATGGCTC CAATTAGCAC C TAT AGG AGT ATTGCATGGG 3663 

C TTTCAAGG A AACTTCTACA TTTATTATTA TTGATACTGT TCTGTTACTG TTATTCCTTT 3723 
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TATGGTCTTC TTGAGACTTA AGTTTGTAGA ATTAAATTTC CCTAGAGCTG GAGATAATGT 3783 
TTAGAGAATT AGGCCAATAA ATTT 380 7 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 -15 -10 -5 

Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg Val Thr Asn Asn Val 

15 10 

Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp Tyr Met lie 
15 20 25 

Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser His Cys Trp 
30 35 40 

lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu 
45 50 55 60 

Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie 
65 70 75 

Asp Lys Leu Val Asn lie Val Asp Asp Leu Val Glu Cys Val Lys Glu 
80 85 90 

Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg 
95 100 105 

Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp 
110 115 120 

Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val Val 
125 130 135 140 

Ser Ser Thr Leu Ser Pro Glu Lys lie Pro Glu Ser Val Ser Gin Asn 
145 150 155 

His Leu Cys Tyr Pro Leu Leu Gin Pro Ala Pro Leu Gly Met Thr Ala 
160 165 170 

Val Ala Val 
175 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 17 . . 640 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 92.. 640 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

AAGCTTGCCT TTCCTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC 4 9 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys 
-25 -20 -15 

ATT TAT CTT CAG CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG 97 
lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly 
-10 -5 1 

ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG 145 
lie Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu 
5 10 15 

GTG GCA AAT CTT CCA AAA GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC 193 
Val Ala Asn Leu Pro Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro 
20 25 30 

GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA 241 
Gly Met Asp Val Leu Pro Ser His Cys Trp lie Ser Glu Met Val Val 
35 40 45 50 

CAA TTG TCA GAC AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT 289 
Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie 
55 60 65 

TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA 337 
Ser Glu Gly Leu Ser Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie 
70 75 80 

GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT CTA 385 
Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu 
85 90 95 

AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA 433 
Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu 
100 105 110 

TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA 481 
Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val 
115 120 125 130 

GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT 52 9 

Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro 
135 140 145 

GAG AAA GAT TCC AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT 577 
Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro 
150 155 160 

GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGT AAG 62 5 

Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys 
165 170 175 

TAC ATA TAT CTG ATT TAATGCATGC ATGGCTCCAA TTAGCACCTA TAGGAGTATT 68 0 

Tyr lie Tyr Leu lie 
180 
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GCATGGGCTT TCAAGGAAAC TTCTACATTT ATTATTATTG ATACTGTTCT GTTACTGTTA 74 0 

TTCCTTTTAT GGTCTTCTTG AGACTTAAGT TTGTAGAATT AAATTTCCCT AGAGC TGGAG 80 0 

ATAATGTTTA GAGAATTAGG 82 0 
(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
105 110 115 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys Tyr lie Tyr Leu lie 
170 175 180 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join ( 565 579 , 1684.. 1797, 2693.. 2755, 3351.. 3521, 

3932.. 4088, 4314.. 4397, 4778.. 4887, 5208.. 5275, 
5677. . 5713) 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: j oin ( 1744 .. 1797 , 2693.. 2755, 3351.. 3521, 3932 

..4088, 4314.. 4397, 4778.. 4887, 5208.. 5275, 5677 
. .5713) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GAGCTCCGAG CCCTCTCTGG CGCGCGAGGT ATTTCGTCTG TNCCCGGGGG TGCCAGGTGA 60 

GCCCCAGCGG ATCCGGGAGG GTAAGCTGGG ACTCCTCGCG AGCAGTAGCT GCAGGGTACC 120 

AAGCTTCGCC CTCTGCGTCC CCGCGCCTTC GCGGTCTCCC GCCAGTGCAG GTCCGGGGCC 180 

CCCAGGCGAG CGGACAAGGT TGGCCTAATC TGCCAAACTT CTGGGGCATT TACCGTGCTC 240 

TGGCCGCCCT CCCGATTCTT CCCTCCGCGC CCTTGCCTGC TTCTCGCCTA CCCCGGGCTC 300 

CGGAAGGGAA GGAGGCGTGT CCGGAGCAGG CGGGCGGGAA CTGTATAAAA GCGCCGGCGG 360 

CTCAGCAGCC GGCTTCGCTC GCCGCCTCGC GCCGAGACTA GAAGCGCTGC GGGAAGCAGG 420 

GACAGTGGAG AGGGCGCTGC GCTCGGGCTA CCCAATGCGT GGACTATCTG CCGCCGCTGT 48 0 

TCGTGCAATA TGCTGGAGCT CCAGAACAGC TAAACGGAGT CGCCACACCA CTGTTTGTGC 54 0 

TGGATCGCAC CGCTGCCTTT CCTT ATG AAG AAG ACA CAA GTGAGTAGGG 58 9 

Met Lys Lys Thr Gin 
-25 

CGCGCCCGGG AGCTCCCAGG CTCTCCAGGA AAAATCGCGC CCGGTGCCCC GGGGAAGCCG 64 9 

GCGCTCCCTG GGACTTGCAG CTGGGGCGTG CAGGGCTGTG CCTGCCGGGT GAGACAAGAG 709 

GATGCGGGGG AGGCCGGCGT GGTGTGTGAT CCCGAGCCGA GCCGNNTGAG CCAGGGAGAA 769 

AAGGAGTGGG AGTACTGAGA GGGAGCCAGT GTCAAGTTTG GAGCCTCAGC AGTTAAGTTT 82 9 

TGAGCTGTCA GTCGGAAACC GTAATTCCCG TCTGGTGGAA AGATTGGCTT TTNGNCCACG 889 

GAATGTAAGT TATCACAGAT ACTACAAAGA TAAATCAGTT GCACAAGTTC TTGAAACTCT 94 9 

ACAGTGTAAT AAGGAAAAAT AAGTCATGCA TAAAAGCAAC TATAATACAT AATAGAAAAT 1009 

GTTATTTTCA AGCCGATGTG TAGGTTATGT GTGTTCGAGA GAGAGAGAGA GAAGACAGAT 1069 

TACTTTCTGC TAGGGTTCAA GAATGCCTTC CTGTTGGCTA AGGAAATATT TTCCTTAAGT 1129 

GGCTAAAAAG CTGTGTTTCA AAATATTCTT TTGATGTCTC ACAAATTCAG TGGAATTCTC 1189 

TTAGGTCTAA AAATATACAT CTCTCTCACT TTAACTTGGT GTGCTATTGT AGATTATTGG 1249 

ATTAAAGCAC TGCTCAGGGA TTATGCTGCT TCTTGCCAAG CAGTCTACAT TTAAAGTAGA 1309 

AATAAGATGT TTCTTTTGGT GCCATAAGGT ATACATTTTA TGCATTCTCT AGTTTTTAGA 1369 

AGATACCCTA AGGGCTAAGT CTTTAACATG CTGCTACAAG TTTATTCCTA ATTGCCATTG 142 9 

GGAAATTGGC TGAAGAAAGT TTTTAACAAA AGTTAACAAT ATTGTCATTG AGAGAATAAT 1489 
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TCAAAATGGA TTTTAACTAA AAGCTTTTAA AAACTTTGGT GAGCATAGCT TGAATGCGTA 1549 

ATATTTAATT GCATTTAAGC CAATAACATA TATTAGACTG GTCTTTTTGT GCATCAAGGC 1609 

ATTAGATGTT AAAAGTTTGA ATGATTACAG ATCTTAACTG ATGATCACCA AGCAATTTTT 1669 

CTGTTTTCAT TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC 1719 
Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu Leu 
-20 -15 -10 

CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG 1767 
Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg Val 
-5 15 

ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG GTAAGTAAGG AATGCTTTAC 1817 
Thr Asn Asn Val Lys Asp Val Thr Lys Leu 
10 15 

CGTGCTGTGT AAAAAAGAGC TGTGGCTCTT TTTCCTGTGC TTGTTGATAA AAGATTTAGA 1877 

TTTTTCTTGC CCCAAAGTAA TGTTTTCCTA AAGTGGGGAA AGTAATCACT GGGTTACAAT 1937 

AAAGGGTTTA TAGAAAGCAG GTAGTGAGAT ATTTAGGGTC ATGGATAATT TGTTGGTAAA 1997 

ACTGGCTAGT TGCACACCAC TGCTGTGACT GCTTCTTTGC TGGTCTTCTC CCCATCCTTC 2 057 

ATAGGCAGTG AAGGAC CTTG GAGAGTTCGC TGTGTGCTGA TGGGCTTGCC CCAGCTTGTT 2117 

CCCCATAATC TCTCCAGTGG GTTTCCCAGC ATGTTCTATT CCCCTTCACA TGT C TTCCT A 2177 

CTCTTCTTTA AAAAGCCTAA CGAAAGGAAA TCTGAAATGG CTATTCTCCC AATTCAATCA 2237 

GCAGGAAGAC CCTGTCACAT GTCAGTGGGT GTTTGCTCCT TCAGGGAACA TAGAGAGGTG 22 97 

ATTCATTGCC CACATGTTGA AGGGACTCAT CTCCCTGGTT TGTCACATTG AACTCTTCCC 2 357 

TCAGCGAAAG CATTTGCATT GCTTCCCGAA TTCCAAGATC ACAGGTGGAA GCTGAAATTC 2417 

AGATCATGTT TCCAAAACTC AGTAGGTTAT ACCTAGCCAG GCATAACTGA ATTTGGAGTC 2477 

TAAAAGATCT GTATTATCAC TTTTTTATTT TGAAGGATGC CTTTTGATTA CAGAGGGAAA 253 7 

TCAAGGATTA AAAATCAATA TACATGTAAA TATTGAAATT CATTGGTAAC TTTAAAAAGC 2597 

ACAACAGTTT TGTGTGCTTT TCTCCAAAGC ACTACAAATA TGATTAATTG ATGTATAAGA 2657 

ATTTTCTTAT GGAATTTTTT TTTTTGTCTC TGTAG GTG GCA AAT CTT CCA AAA 2710 

Val Ala Asn Leu Pro Lys 
20 

GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG 2755 
Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

GTATGTAAAC TACATTTCTG AGTTTCATTT TAGTAG CTC A TAGAAGAAAT GGG AT C ATTC 2815 

ATATTGAGAT AGTACACTAG CTGCTATTTA GGAGCTTGCT TATTGTCAGG ATTTGAAGAA 2875 

TTTATCTTTG GAATTTGACT TGCAGGCTTT TTTTTCCCCC TCTTCCTGTT ACAAGAGTCC 2935 

CTCCTCCTAT TACAATAGTC CCTCCTCCTC CTGTCACACT AGTCCCTTCT CTTCCTGTTA 2995 

CAATAACCCC TGTCCTCCTA TTACAACATT TTAAGTAATG TAATATTAAT TTTAAAAATC 3055 

TGGCCAGGCA CGGTGGTTCA TGCTTGTAAT CCCAGCACAT TGGGAAGCTG AGACGGGTGG 3115 
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ATCATTTGAG GTCAGGAAGT TTGAGACAGC CTGGCCAACA TGGTGAAACT TCCTCTCTAC 3175 

TAAAAATAAA AAAGTAGCCA GGCATGGTGG CAGGCACTTG TAATCTGAGC TACTCGAGAG 3235 

GCTGAGGCAG GAGAAT C ACT TGAGTAACTA AAACGATAGC TTTGAAGAGT ACTCCGAGTT 3295 

TTATGGCACT TACTTATTAA AATAGCTGTT TTGTCTCTTT TTTCATATCT TGCAG CCA 3353 

Pro 
40 

AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC AGC TTG 3401 
Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu 
45 50 55 

ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT 344 9 

Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn 
60 65 70 

TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG 3497 
Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val Glu 
75 80 85 

TGC GTG AAA GAA AAC TCA TCT AAG GTAAC TTTGT GTTCATTGGG ATTATTTTTC 3551 
Cys Val Lys Glu Asn Ser Ser Lys 
90 95 

ATTACGCTTC TCTAAAAACC CATGCTTCTT GGTGCTGTTG GGGAAAATGA GGC AC CTTT A 3611 

TTTATGATAT TTTGATTGTA TAAACTTCAA ATTTAAAAAT CTTGTTCAGA TGAGCAAAGA 3671 

AAACAAGTAT TTGCAGTTAT ACTGCAATAC TGAAGTGCAC TATTCTTGTG TTCACTGCCC 3 731 

CAGATTCAAC TTGTGATCCC ACTGGGATCA CTACCCTGCA TTACCAATCT GAATTACATA 3791 

CGTTAAAACA GCCATCTAAA AGTGCTAGTT GTAAGAGTCT AAATACTTGA ATCTTTGAGA 3851 

GACATATTTA TAGTCCATTA TCTTCACCTC AGTTAAGTCT GAAGACTATT TGAAAAATGT 3 911 

AATCCTATTT TTTCTTCTAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA 3961 

Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu 
100 105 

CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC 4 009 

Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser 
110 115 120 

ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT 4 057 

lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys 
125 130 135 

GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA G GTAAGACATG TAAGCATTTC 4108 
Val Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 145 

CAGTTCAAAT GTAAACAACA AACTTAAATC TTCCCTATGT AGTAAGAATC TACCTCTGTG 4168 

TTAAGCTGTA GCAAGATACA TGCATGTACG TCTAAAAAAA AGCAGATATC AATAGCACAG 4228 

AAGAAACTAA TGATTGTAGA TTTGTGGGCT CTATAACTCA TACAAATCAC CATATAACAC 4288 

TGACACATTA TTGCTTTCTA TTTAG AT TCC AGA GTC AGT GTC ACA AAA CCA 4339 

Asp Ser Arg Val Ser Val Thr Lys Pro 
150 155 

TTT ATG TTA CCC CCT GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT 4387 
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Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser 
160 165 170 

AGC AGT AAT A GTAAGTACAT ATATCTGATT TAATGCATGC ATGGCTCCAA 4437 
Ser Ser Asn 
175 



TTAGCACCTA 


TAGGAGTATT 


GCATGGGCTT 


TCAAGGAAAC 


TTCTACATTT 


ATTATTATTG 


4497 


ATACTGTTCT 


GTTACTGTTA 


TTCCTTTTAT 


GGTCTTCTTG 


AGACTTAAGT 


TTGTAGAATT 


4557 


AAATTTCCCT 


AGAGCTGGAG 


ATAATGTTTA 


GAGAATTAGG 


CCAATAAATT 


TTCTGCTGAG 


4617 


GTTATTTTAA 


ATAAGACATA 


AAATTAATTT 


TAGAAATATG 


ATTTATGCCT 


TTTGTTGAAT 


4677 


CATTAACATA 


TATACAGAAA 


CAGTTAAAAC 


AAC C AC AGC A 


TAAGAGAAAA 


ACTTCTAGAA 


4737 


TGGATATGCT 


GTATTCATCA 


GTGTGTTCTT 


TAAATTATAG 


GG AAG GCC AAA AAT 


4791 



Arg Lys Ala Lys Asn 
180 

CCC CCT GGA GAC TCC AGC CTA CAC TGG CCA GCC ATG GCA TTG CCA GCA 4839 
Pro Pro Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala 
185 190 195 

TTG TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG 4887 
Leu Phe Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys 
200 205 210 



GTAAGTGGTA 


CCATTCCTTT 


TTTTAAAAAT 


ATGCTATGTT 


TACATAAATT 


ATCATCTTTT 


4947 


TTTCCTCAAG 


AAATGATCCT 


TTAAGAAAAC 


AGTGAATCTA 


CCTTAGCTTA 


TACTAAACAA 


5007 


AATTTAAATT 


TTATAAAGTT 


TCCTGTTTCT 


CATTATGTCT 


GGAGACAATC 


CCTCTAGCTG 


5067 


ATAATTCACG 


CTTAAGAATT 


AGGAACTAAA 


ACTGTTATTG 


GAGTTATTGC 


CATAAAAGAT 


5127 


AAAAGTGGAG 


TCCACTTACC 


TCTTAAATAT 


TAGACCATTC 


ATTGATTATT 


TTACAGTATA 


5187 


TGTCTTTCTT 


CTTTTTCCAG 


AAG AGA CAG CCA AGT CTT ACA AGG GCA GTT 


5237 



Lys Arg Gin Pro Ser Leu Thr Arg Ala Val 
215 220 

GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA AG GTATTTTGTT 5285 
Glu Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie Ser 
225 230 235 

TTGC TAAATG TGTGCCCAAT CAAGCATGAC ATTGCCATTT CACACACTGT GTACCTGCCC 5345 

ATAATGTCTT TAAGAAGTCC TTCACTCATG ACAGTAGCTC CTAACCAGTG AGTCCCAACT 5405 

CTATCCATGT TTCTGATGTC TCACTCTCTC TTCGTATGTG TATATGCATA TACAGAGAAA 5465 

GAAATGTTTT AACTACTTGG AAAGACTACC TTAAGACAAA TGAAGTCTTC CCTCTTCCCT 5525 

ATAGTAATAA GAAGGTAGGC TCCCCCATTC AATTTTGCAA TCTTCTGCTA CTATATTTAC 5585 

AGAAAAGCTG CCTTTTACAA TGCCGAGATC ATGGTGTACC TCAGAATCTC TGACCAAGAG 5645 

CAAATAAGCA TTTTTTCTTA TTGTTTTTCA G T ATG TTG CAA GAG AAA GAG AGA 5698 

Met Leu Gin Glu Lys Glu Arg 
240 

GAG TTT CAA GAA GTG TAATTGTGGC TTGTATCAAC ACTGTTACTT TCGTACATTG 5753 
Glu Phe Gin Glu Val 
245 
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GTAAGTTTTT TTCTTCTTTC CTTTTTTTTT CTTTTTTTTA TTATACTTTA AGTTCTAGGG 5813 
TACATGTGCA CAATGTGCAG GTTTGTTACG TATGTTTACA TGTGCCATGT T 5864 
(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys He Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
200 205 210 215 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
220 225 230 

Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 
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Val 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : s ingl e 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
100 105 110 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
195 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
210 215 220 

Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
225 230 235 240 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 
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Val 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
100 105 110 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Ala Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Thr 
195 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Phe Phe 
210 215 220 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
225 230 235 240 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 
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Val 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Gly lie Cys Gly Lys Arg 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Lys lie Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp lie Ser Val Met Val Glu Gin Leu Ser Val Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val 
100 105 110 

Glu Cys Thr Glu Gly Tyr Ser Phe Glu Asn Val Lys Lys Ala Pro Lys 
115 120 125 

Ser Pro Glu Leu Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Leu Glu Thr Val Ala Ser Lys 
145 150 155 160 

Ser Ser Glu Cys Val Val Ser Ser Thr Leu Ser Pro Asp Lys Asp Ser 
165 170 175 

Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
180 185 190 

Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser 
195 200 205 

lie Gly Asp Ser Asn Leu Gin Trp Ala Ala Met Ala Leu Pro Ala Phe 
210 215 220 

Phe Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
225 230 235 240 

Lys Gin Pro Asn Leu Thr Arg Thr Val Glu Asn lie Gin lie Asn Glu 
245 250 255 

Glu Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
260 265 270 
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Glu Val 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Thr Gin Thr Trp lie Val Thr Cys lie Tyr Leu Gin Leu Leu 
15 10 15 

Phe Asn Pro Leu Val Lys Thr Lys Gly Leu Cys Arg Asn Arg Val Thr 
20 25 30 

Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp 
35 40 45 

Tyr Lys lie Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 
50 55 60 

His Cys Trp lie Ser Val Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 

Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr 
85 90 95 

Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val Glu Cys 
100 105 110 

Val Glu Gly His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser Pro 
115 120 125 

Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg 
130 135 140 

Ser lie Asp Ala Phe Lys Asp Leu Glu Met Val Ala Ser Lys Thr Ser 
145 150 155 160 

Glu Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg Val 
165 170 175 

Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu 
180 185 190 

Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Thr Asn Pro lie Glu Asp 
195 200 205 

Ser Ser lie Gin Trp Ala Val Met Ala Leu Pro Ala Cys Phe Ser Leu 
210 215 220 

Val lie Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys Gin Pro 
225 230 235 240 

Asn Leu Thr Arg Thr Val Glu Asn lie Gin lie Asn Glu Glu Asp Asn 
245 250 255 

Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
260 265 270 



ft 
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INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val His Thr Gin Gly lie Cys Ser Asn Arg 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp lie Ser Glu Met Val Glu Gin Leu Ser Val Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Cys lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val 
100 105 110 

Glu Cys Met Glu His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser 
115 120 125 

Pro Glu Pro Arg Gin Phe Thr Pro Glu Lys Phe Phe Gly lie Phe Asn 
130 135 140 

Lys Ser lie Asp Ala Phe Lys Asp Leu Glu lie Val Ala Ser Lys Met 
145 150 155 160 

Ser Glu Cys Val lie Ser Ser Thr Ser Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser lie 
195 200 205 

Glu Asp Ser Ser Leu Gin Trp Ala Ala Val Ala Leu Pro Ala Phe Phe 
210 215 220 

Ser Leu Val lie Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Pro Asn Leu Thr Arg Thr Val Glu Asn Arg Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 
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(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu lie Cys Arg Asn Pro 
20 25 30 

Val Thr Asp Asn Val Lys Asp lie Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met lie Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
65 70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
100 105 HO 

Ala Cys Met Glu Glu Asn Ala Pro Leu Asn Val Lys Glu Ser Leu Lys 
115 120 125 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
145 150 155 160 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
195 200 205 

Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu He 
210 215 220 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 
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(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Glu lie Cys Gly Asn Pro 
20 25 30 

Val Thr Asp Asn Val Lys Asp lie Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met lie Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val lie Gin Leu Ser Leu Ser 
65 70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Gly Lys lie Val Asp Asp Leu Val 
100 105 110 

Leu Cys Met Glu Glu Asn Ala Pro Lys Asn lie Lys Glu Ser Pro Lys 
115 120 125 

Arg Pro Glu Thr Arg Ser Phe Thr Pro Glu Glu Phe Phe Ser lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
145 150 155 160 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ala Pro 
195 200 205 

Glu Asp Ser Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu lie 
210 215 220 

Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 



Val 
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(2) INFORMATION FOR SEQ ID NO: 56; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Thr Trp lie lie Thr Cys Phe Cys Leu Gin Leu Leu Leu Leu Asn Pro 
15 10 15 

Leu Val Lys Ala Gin Ser Ser Cys Gly Asn Pro Val Thr Asp Asp Val 
20 25 30 

Asn Asp lie Ala Lys Leu Val Gly Asn Leu Pro Asn Asp Tyr Leu lie 
35 40 45 

Thr Leu Lys Tyr Val Pro Lys Met Asp Ser Leu Pro Asn His Cys Trp 
50 55 60 

Leu His Leu Met Val Pro Glu Phe Ser Arg Ser Leu His Asn Leu Leu 
65 70 75 80 

Gin Lys Phe Ser Asp lie Ser Asp Met Ser Asp Val Leu Ser Asn Tyr 
85 90 95 

Ser lie lie Asn Asn Leu Thr Arg lie 13 e Asn Asp Leu Met Ala Cys 
100 105 HO 

Leu Ala Phe Asp Lys Asn Lys Asp Phe lie Lys Glu Asn Gly Leu His 
115 120 125 

Tyr Glu Glu Asp Arg Phe lie Pro Glu Asn Phe Phe Arg Leu Phe Asn 
130 135 140 

Ser Thr lie Glu Val Tyr Lys Glu Phe Ala Asp Ser Leu Asp Lys Asn 
145 150 155 160 

Asp Cys lie Met Pro Ser Thr Val Glu Thr Pro Glu Asn Asp Ser Arg 
165 170 175 

Val Ala Val Thr Lys Thr lie Ser Phe Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser He Gly Ser Asn Thr Ser Ser Asn Ser Asn Lys 
195 200 205 

Glu Ala Leu Gly Phe He Ser Ser Ser Ser Leu Gin Gly He Ser He 
210 215 220 

Ala Leu Thr Ser Leu Leu Ser Leu Leu He Gly Phe He Leu Gly Ala 
225 230 235 240 

He Tyr Trp Lys Lys Thr His Pro Lys Ser Arg Pro Glu Ser Asn Glu 
245 250 255 

Thr He Gin Cys His Gly Cys Gin Glu Glu Asn Glu He Ser Met Leu 
260 265 270 

Gin Gin Lys Glu Lys Glu His Leu Gin Val 
275 280 
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(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gly lie Cys Arg Asn Arg Val 
20 25 30 

Thr Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp 
35 40 45 

Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 
50 55 60 

His Cys Trp lie Ser Glu Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 

Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr 
85 90 95 

Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val Glu Cys 
100 105 110 

Glu Glu Asn Ser Ser Lys Asn Val Lys Lys Ser Lys Ser Pro Glu Pro 
115 120 125 

Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie 
130 135 140 

Asp Ala Phe Lys Asp Phe Met Val Ala Ser Lys Thr Ser Asp Cys Val 
145 150 155 160 

Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg Val Ser Val Thr 
165 170 175 

Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp 
180 185 190 

Ser Ser Ser Ser Asn Arg Lys Ala Asn Glu Asp Ser Ser Leu Gin Trp 
195 200 205 

Ala Ala Met Ala Leu Pro Ala Leu Phe Ser Leu Val lie Gly Phe Ala 
210 215 220 

Phe Gly Ala Leu Tyr Trp Lys Lys Lys Gin Pro Ser Leu Thr Arg Ala 
225 230 235 240 

Val Glu Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie Ser Met Leu 
245 250 255 

Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
260 265 

(2) INFORMATION FOR SEQ ID NO: 58: 



• 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin (1 . . 210 , 223.. 258) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GAA TTC TTC CGT ATC TTC AAC CGT TCC ATC GAC GCT TTC AAA GAC TTC 48 
Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 
15 10 15 

GTT GTT GCT TCC GAA ACC TCC GAC TGC GTT GTT TCC TCC ACC CTG TCT 96 
Val Val Ala Ser Glu Thr Ser Asp Cys Val Yal Ser Ser Thr Leu Ser 
20 25 30 

CCG GAA AAA GAC TCC CGT GTT TCG GTT ACC AAA CCG TTC ATG CTG CCG 144 
Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

CCG GTT GCT GCT TCC TCC CTG CGT AAC GAC TCC TCC TCC TCC AAC TCC 192 
Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

AAA TAC ATC TAC CTG ATC TAATAGGATC CG GTT ACC AAA CCG TTC ATG 24 0 

Lys Tyr lie Tyr Leu lie Val Thr Lys Pro Phe Met 

65 70 75 

CTG CCG CCG GTT GCT GCT TAATAGGATC C 269 
Leu Pro Pro Val Ala Ala 
80 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 
15 10 15 

Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser 
20 25 30 

Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

Lys Tyr lie Tyr Leu lie Val Thr Lys Pro Phe Met Leu Pro Pro Val 
65 70 75 80 



Ala Ala 
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(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 184 . . 1002 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 259.. 1002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

CCGCCTCGCG CCGAGACTAG AAGCGCTGCG GGAAGCAGGG ACAGTGGAGA GGGCGCTGCG 60 

CTCGGGCTAC CCAATGCGTG GACTATCTGC CGCCGCTGTT CGTGCAATAT GCTGGAGCTC 120 

CAGAACAGCT AAACGGAGTC GCCACACCAC TGTTTGTGCT GGATCGCAGC GCTGCCTTTC 180 

CTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG 228 
Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys He Tyr Leu Gin 
-25 -20 -15 

CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT 276 
Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn 
-10 -5 15 

CGT GTG ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG GTG GCA AAT CTT 324 
Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu 
10 15 20 

CCA AAA GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT 372 
Pro Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val 
25 30 35 

TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC 420 
Leu Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp 
40 45 50 

AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG 468 
Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu 
55 60 65 70 

AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT 516 
Ser Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu 
75 80 85 

GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT CTA AAA AAA TCA TTC 564 
Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe 
90 95 100 

AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT 612 
Lys Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He 
105 110 115 

TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA 660 
Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu 
120 125 130 



0 
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ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA GAT TCC 708 
Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser 
135 140 145 150 

AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC AGC 7 56 

Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
155 160 165 

TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGG AAG GCC AAA AAT CCC 804 
Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro 
170 175 180 

CCT GGA GAC TCC AGC CTA CAC TGG GCA GCC ATG GCA TTG CCA GCA TTG 852 
Pro Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu 
185 190 195 

TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG AAG 900 
Phe Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
200 205 210 

AGA CAG CCA AGT CTT ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA 948 
Arg Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu 
215 220 225 230 

GAG GAT AAT GAG ATA AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA 996 
Glu Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
235 240 245 

GAA GTG TAATTGTGGC TTGTATCAAC ACTGTTACTT TCGTACATTG GCTGGTAACA 1052 
Glu Val 

GTTCATGTTT GCTTCATAAA TGAAGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA 1112 

CAGACCACAT CTTTATCTGT TCTTGCTACC CATGACTTTA TATGGATGAT T C AG AAATTG 1172 

GAACAGAATG TTTTACTGTG AAAC TGGC AC TGAATTAATC ATCTATAAAG AAGAACTTGC 1232 

ATGGAGCAGG ACTCTATTTT AAGGACTGCG GGACTTGGGT CTCATTTAGA ACTTGCAGCT 12 92 

GATGTTGGAA GAGAAAGCAC GTGTCTCAGA CTGCATGTAC CATTTGCATG GCTCCAGAAA 1352 

TGTCTAAATG CTGAAAAAAC ACCTAGCTTT ATTCTTCAGA TACAAACTGC AG 1404 



(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 



Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 
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Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
105 110 115 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
200 205 210 215 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
220 225 230 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 151.. 885 

(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 226.-885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
AGCAGGGACA GTGGAGAGGG CGCTGCGCTC GGGCT AC C C A ATGCGTGGAC TATCTGCCGC 60 
CGCTGTTCGT GCAATATGCT GGAGCTCCAG AACAGCTAAA CGGAGTCGCC ACACCACTGT 12 0 
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TTGTGCTGGA TCGCAGCGCT GCCTTTCCTT ATG AAG AAG ACA CAA ACT TGG ATT 174 

Met Lys Lys Thr Gin Thr Trp He 
-25 -20 

CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT CCT CTC GTC AAA 222 
Leu Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys 
-15 -10 -5 

ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAC GTC 270 
Thr Glu Gly He Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val 
15 10 15 

ACT AAA TTG GTG GCA AAT CTT CCA AAA GAC TAC ATG ATA ACC CTC AAA 318 
Thr Lys Leu Val Ala Asn Leu Pro Lys Asp Tyr Met He Thr Leu Lys 
20 25 30 

TAT GTC CCC GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG 366 
Tyr Val Pro Gly Met Asp Val Leu Pro Ser His Cys Trp He Ser Glu 
35 40 45 

ATG GTA GTA CAA TTG TCA GAC AGC TTG ACT GAT CTT CTG GAC AAG TTT 414 
Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe 
50 55 60 

TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT 462 
Ser Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu 
65 70 75 

GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT 510 
Val Asn He Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser 
80 85 90 95 

AAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTC TTT ACT 558 
Lys Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr 
100 105 110 

CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG 606 
Pro Glu Glu Phe Phe Arg He Phe Asn Arg Ser He Asp Ala Phe Lys 
115 120 125 

GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA 654 
Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr 
130 135 140 

TTA AGT CCT GAG AAA GGG AAG GCC AAA AAT CCC CCT GGA GAC TCC AGC 702 
Leu Ser Pro Glu Lys Gly Lys Ala Lys Asn Pro Pro Gly Asp Ser Ser 
145 150 155 

CTA CAC TGG GCA GCC ATG GCA TTG CCA GCA TTG TTT TCT CTT ATA ATT 750 
Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe Ser Leu He He 
160 165 170 175 

GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG AAG AGA CAG CCA AGT CTT 798 
Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg Gin Pro Ser Leu 
180 185 190 

ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA 846 
Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu Asp Asn Glu He 
195 200 205 

AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA GAA GTG TAATTGTGGC 895 
Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
210 215 220 

TTGTATCAAC ACTGTTACTT TCGTACATTG GCTGGTAACA GTTCATGTTT GCTTCATAAA 955 



• 
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TGAAGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA CAGACCACAT CTTTATCTGT 



1015 



TCTTGCTACC CATGACTTTA TATGGATGAT TCAGAAATTG GAACAGAATG TTTTACTGTG 



1075 



AAACTGGCAC TGA 



1088 



(2) INFORMATION FOR SEQ ID NO: 63: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Gly Lys Ala 
140 145 150 

Lys Asn Pro Pro Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu 
155 160 165 

Pro Ala Leu Phe Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr 
170 175 180 

Trp Lys Lys Arg Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin 
185 190 195 

He Asn Glu Glu Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg 
200 205 210 215 

Glu Phe Gin Glu Val 



220 
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(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp lie Thr 
15 10 15 

Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

Val Ala Gly Met Asp Val Leu Pro Ser His Xaa Trp Leu Arg Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

lie Thr Thr Leu Asn Tyr Val Ala Gly Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Xaa Xaa Leu Gly Pro 
15 10 15 

Glu Lys Asp Ser Arg Val Ser Val Xaa Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
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Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Glu Glu Asn Ala Pro Lys Asn Val Glu Ser Leu Lys Lys Pro Thr Arg 
15 10 15 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Asp Arg Ser lie Asp 
20 25 30 

Ala 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Glu Ser Leu Lys Lys Pro Glu Thr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Val Ser Val Xaa Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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He Val Asp Asp Leu Val Ala Ala Met Glu Glu Asn Ala Pro Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 72 : 

Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe Xaa Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Leu Val Ala Asn Leu Pro Asn Asp Tyr Met He Thr Leu Asn Tyr Val 
15 10 15 

Ala Gly Asp Asp Val Leu Pro Ser His Cys Trp Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp 
15 10 15 

Cys Val Leu Ser Xaa Xaa Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
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Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe 
15 10 15 

Phe Ser lie Phe Xaa Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe 
15 10 15 

Phe Ser lie Phe Asp Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Asn Ala Pro Lys Asn Val Lys Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ser Arg Val Ser Val Xaa Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
15 10 15 

Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala 
20 25 30 

Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
15 10 15 

Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala 
20 25 30 

Ser Asp Thr Ser Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
15 10 15 

Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie 
20 25 30 

Asp Lys Leu Gly Lys lie Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
15 10 15 



-234- 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
Pro Val Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
CCTGAGAAAG ATTC C AG AGTC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
CTGCAGTTTG TATCTGAAG 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
CATATAAAGT CATGGGTAG 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
ACTTGTGTCT TCTTCATAAG GAAAGGC 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
TGTACGAAAG TAACAGTGTT G 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
ACTGCTCCTA TTTAATCCTC TC 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 
CACTGACTCT GGAATCTTTC TCA 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
TCGACCCGGA TCCCC 

(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 



4 
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(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE S S : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
TCGAGGGGAT CCGGG 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TCTTCTTCAT GGCGGCGGCA AGCTT 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) S TRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Asp Ser Arg Val Ser Val Xaa Lys Pro Phe Phe Met Leu Pro Pro Val 
15 10 15 

Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Asp Ser Arg Val Ser Val Thr Lys Pro Phe Phe Met Leu Pro Pro Val 



1 
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10 



15 



Ala 



Ala 



(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 55 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 49 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 
TATGCAGGA 9 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GATCTC CTGC A 11 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TATGGAAGGT ATCTGCA " 
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(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) \ LENGTH: 11 base pairs 

(B) \TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENC^ DESCRIPTION: SEQ ID NO: 101; 
GATACCTTCC A 

(2) INFORMATION FOfe SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: iaucleic acid 

(C) STRANDEDifess : single 

(D) TOPOLOGX :\ linear 

(ii) MOLECULE TYPe\ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102; 
TTTCCTTATG 

(2) INFORMATION FOR SEQ iAnO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 babe pairs 

(B) TYPE: nucleic \acid 

(C) STRANDEDNESS: teingle 

(D) TOPOLOGY: line* 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: \sEQ ID NO: 103 
GCCGCCGCCA TG 

(2) INFORMATION FOR SEQ ID NO: 10-! 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 25 base pair^E 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ Id\nO:104 
TCTTCTTCAT GGCGGCGGCA AGCTT 
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